Strong cooperativity and inhibitory effects in DNA multi-looping processes 
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We show the existence of a high interrelation between the different loops that may appear in a 
DNA segment. Conformational changes in a chain segment caused by the formation of a particular 
loop may either promote or prevent the appearance of another. The underlying loop selection 
mechanism is analyzed by means of a Hamiltonian model from which the looping free energy and 
the corresponding repression level can be computed. We show significant differences between the 
probability of single and multiple loop formation. The consequences that these collective effects 
might have on gene regulation processes are outlined. 
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Loop formation in DNA complexes has been identified 
as a fundamental mechanism in gene regulation processes 
P) S B 0) Operators for DNA-protein interaction 
modify their relative positions through the formation of 
loops and thereby may operate even if they are not physi- 
cally close together. This mechanical process is governed 
by the physical properties of the DNA and the concentra- 
tion of proteins and has a deep impact on gene synthesis 
processes. The fixation of operators combined with pro- 
tein concentrations is responsible for control processes 
inside the cell. 

Elastic models have been proposed for the study of the 
physical properties of the DNA chain and the emerging 
phenomena like cyclisation and looping 0, 0] , and are the 
basis for large scale simulations of protein complexes Q . 
Within this approach, the elasticity of the bonds between 
the nucleotid bases determine the physical properties of 
DNA through its degrees of freedom. An important step 
toward the understanding of looping phenomena within 
a physical context was given in where the effect of 
protein concentration was related to multiprotein bond- 
ing positions. An induced phase transition to the loop 
phase is controlled by the protein concentration. Fol- 
lowing this physical analysis, a model of loop formation 
has been proposed using ideas from statistical mechanics 
which provides a clear picture of the connection between 
the protein concentrations, the free energy involved in 
loop formation and protein binding, as well as the 
structure of the DNA. The transition between the loop 
formation phase was reported for the case of a single loop 
and multiple proteins. 

In this Letter, we show that the loop selection process 
is the result of a strong competition between the different 
types of loops that can be formed in the same DNA frag- 
ment. These loops may appear in DNA segments with 
several binding configurations, but also in single loop con- 
figurations with the possibility of different spatial dispo- 
sitions of the looped segment [lO| . Loop formation entails 
changes in the structure of the DNA chain which allow 
distal operators to come into range of a binding protein 
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FIG. 1: (a) Loops are formed in a DNA segment by pro- 
tein binding. The DNA chain is looped by the interaction of 
binded protein monomers fixed at corresponding distant sites. 
We track this binding using the binary variables au,i,OD.i- 
OLf. marks the formation of the loop k. (b) Different loops 
may appear due to alternate protein configurations or DNA 
spatial disposition. 



(see Fig.l). However, in a scenario where multiple loops 
may appear, these conformational changes can hamper 
or even promote additional loop creation once the ap- 
pearance of a loop has modified the conditions necessary 
for the formation of additional loops. The possibility of 
formation of multiple loops becomes manifest through an 
effective interaction between loops that may for instance 
affect their size 

We focus on the formation of competing types of loops 
in a segment of DNA assuming that only one loop may 
be present at the same time in the segment. The con- 
ditions necessary for the formation of a loop are either 
geometrical, where the required operators have been set 
in positions that are incompatible with additional loop 
formation, or energetic, where the energy to form an- 
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other loop is not strong enough to undo an existing loop. 
In general, the most energetically favorable loops will be 
dominant; however, other loops may also emerge due to 
the interaction of the proteins binded to the chain during 
loop formation. As a result, a conformational interaction 
is induced between potential loops. 

Loop formation due to the binding of multiple proteins 
can be put in a statistical mechanics language by means 
of a Hamiltonian model which reflects the successive steps 
intervening in the process 0. In a DNA segment with 
2N binding positions with M different loops, the corre- 
sponding Hamiltonian can be written as 
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Here the set of binary variables iTL,fc (=0,1) accounts 
for the formation of a type Lk loop, and the variables 
au,i and aD,i indicate the binding of a protein monomer 
at the corresponding position (see Fig.l). The contri- 
butions to the free energy for the formation of a loop 
are introduced through the coefficients Ck (which are in- 
dependent of the chain length), while the coefficients 
e/c on the other hand, multiply the number of dimers 
au,iO'D,i contributing to loop formation which can be a 
function of the chain length. Different types of loops may 
carry different values of Ck and e^. The coefficients gu,i 
and gD,i are associated to the contributions of binding 
a monomer to the chain. Throughout this work we set 
9u.i = 9D.i = 9 = 9o ^ where the protein con- 

centration n is introduced in the Hamiltonian, and the 
binding contribution g is site independent. 

Two-loop interaction- We focus our analysis on the 
case M = 2 which shows the basic features of loop in- 
teractions. An additional study of cases with M > 2 
has revealed the absence of important differences in the 
loop selection mechanism. Changes in the chain due to 
the formation of a loop Li modify the conditions under 
which another potential configuration of a looped phase 
L2 may emerge. This situation can be found in short 
chains where the deformation of the DNA after the for- 
mation of a loop alters the distance and possible contact 
between distal monomers. We then envisage a scenario 
where loops of different free energies of formation com- 
pete. Once one of the loops is formed, there is no room 
for others. This restriction can be mathematically ex- 
pressed as 
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By using Montecarlo methods, the probabilities 

Pioop{Lk) = {cTLk) and Pbound = {(^u/D,i) can be com- 





FIG. 2: (a) Loop formation probability for two coexisting 
loops with A'^ = 30, ci =4,2 kcal/mol, ei = —8 kcal/mol, 
C2 = 3 kcal/mol and e2 = —7, 9 kcal/mol. The solid lines 
show the probability for the case in which only one loop can 
be formed while the marks are the result for the case of two 
loops. The dashed line shows the total probability of the 
looped phase, (b) Binding probability of a protein monomer 
in the DNA, in the same conditions. These proteins con- 
tribute to the formation of the two loops. The solid lines 
show the binding probability for the one-loop case. 



puted from the resulting equilibrium states. Analyti- 
cal results can be obtained for the single-loop scenario, 
and are used as a reference for the multiple-loop results 
shown here. The protein concentration n is the order 
parameter which describes the transition between the 
looped/unlooped phases Q of the different L^. To ana- 
lyze this transition, we deal with M -\-2 body interactions 
corresponding to the interaction of two operator sites to 
form one loop and the restriction imposed over the M 
loops. Adding the restriction Eq[21 we study the val- 
ues Pioop{Li) and Pioop{L2) for a chain with two possible 
loops with different free energy contributions. 

We start by studying the formation of two loops Li and 
L2 in a chain with N = 30, ci = 4, 2 kcal/mol, ei = —8 
kcal/mol, C2 = 3 kcal/mol and 62 = —7,9 kcal/mol. We 
set 5o = —7.2 kcal/mol and /3~^ — 0.6 kcal/mol in all 
our computations. In Figl^l we show the results for the 
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probabilities of loop formation (top) and the probability 
of binding a monomer (bottom). The solid lines repre- 
sent the expected values of the probability of single-loop 
formation in the absence of interaction, taken from [9|. 
The formation of multiple loops can be analyzed simi- 
larly through that of a single-loop with an effective in- 
teraction. The marks show the corresponding results of 
the Montecarlo simulation. 

Under these conditions, one of the two loops appears 
only in a small range of the protein concentration n. 
Thus, the activity of the cell processes associated with 
the formation of this loop is restricted to this range of 
concentrations, making induced loop interaction a mech- 
anism for gene control inside the cell. This behavior is 
produced by the two different contributions to the free 
energy of the loop formation, given through the term Ck, 
independent of the chain size, and the term Ck which de- 
pends on the number of protein dimers present in the 
chain. This contribution depends on the protein con- 
centration n inside the cell, becoming greater for higher 
values of n. Thus, a loop may become dominant at low n 
due to a dominant constant contribution Ck- By increas- 
ing the protein concentration, the free energy contribu- 
tion of term becomes dominant due to the formation 
and binding of more dimers contributing to loop forma- 
tion. This mechanism changes the corresponding loop 
probability of the different types of loops (see FiglSJ^a)). 
The binding probability of the monomers gets contribu- 
tions from the two forming loops, thus becoming the basic 
mechanism behind loop interaction. In Figlllb), Ptound 
is equal to that of the dominant loop for high n, while for 
n ~ 3 — 4, it receives contributions from the two loops. 

We have extended the interaction study to a range of 
values of ei and 62 for which the coexisting loop picture 
goes from an equiprobable disposition of both loops (fix- 
ing Ci — C2) to a situation where one of the loops dom- 
inates. For 62 = El + Ae, with increasing Ae, we iden- 
tify the transition region where the probability Pioop{L2) 
is zero for high protein concentrations. The results are 
shown in FigO This transition depends nontrivially on 
the respective values of ei and Ae and the protein concen- 
tration n and shows a progressive inhibition of L2 forma- 
tion for increasing Ae. The L2 formation is restricted to 
a progressively narrow range of values of n, making this 
mechanism a way to activate some cell processes for very 
particular protein concentrations. As explained above, 
this fact is a consequence of the dimer formation that 
contributes to the formation of the loop. The dimer con- 
centration increases with n which can be interpreted as 
the contribution to the free energy of the dimer forma- 
tion. 

We will now analyze the case of two loops with ei = 62 
and ci = C2 + Ac. The transition in this case is driven by 
a constant contribution to the Hamiltonian independent 
of n. In FiglH we show Pioop for different values of Ac. 
As expected, for high values of n there is no variation in 




FIG. 3: Loop formation probability Pioop for two coexist- 
ing loops with A'' = 30, ci = C2 = 3 kcal/mol, ei = —8 
kcal/mol and 62 = —7.9 kcal/mol (a), —7.95 kcal/mol (b), 
-7.97 kcal/mol (c), -7.98 kcal/mol (d). The solid lines show 
the results for the one-loop case, while the marks show the 
results of the Montecarlo simulation. 



Pioop after the transition, resulting in the same relative 
probabilities for the two loops at different protein con- 
centrations. This behavior is of a completely different 
nature from that shown in Fig [3] and can be interpreted 
as the contribution of the different structures of DNA 
to the free energy. This situation may appear in loops 
with different potential physical dispositions, with differ- 
ent values of Cfc, but formed in equivalent conditions of 
dimer bonding. 

Repression level- We have computed the probabilities 
of two types of loops Li and L2 in a single DNA chain. In 
physiological conditions where the looped phase is associ- 
ated with the repression of a gene {i.e. the lac operon in 
E.coli [III), an effective looped phase probability Ph^fj 
can be computed. Under some conditions this probability 
is higher than the respective probabilities in the single- 
loop case (see Fig.2(a)): Pioop{Li) and Pioop{L2)- The 
probability of a sate is determined by its standard free 
energy Hk through P^ oc g-^^/RT ^ normahzed by the 
probabilities of all the possible configurations. Thus the 
effective free energy of the looped phase Hb^ff satisfies 



(3) 



where H^^ and H^^ arc the corresponding free energies 
of the looped phases Li and L2. 

The contributions to the free energy of the DNA 
molecule can be identified with the repression levels 
[13I 14 1 . The free energy of the DNA molecule can be 
computed from the different contributions of binding and 
loop formation. Hence we can connect this physical in- 
terpretation with the experimental measurements of the 
repression levels. Taking into consideration the lac re- 
pression mechanism, the repression level Rioop with a 
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FIG. 4: Loop formation probability Pioop for two coexisting 
loops with ei = 62 = —8 kcal/mol, ci — 3 kcal/mol and 
C2 — 3.3 kcal/mol (a), 3.6 kcal/mol (b), 3.9 kcal/mol(c). The 
solid lines show the single- loop values (in isolated conditions), 
while the marks show the result of the Montecarlo simulation. 



single looped phase is given by 

Rioop = 1 + [[N] + e-«-i/^^) . (4) 

The repression level in loop interaction conditions Rioop, 
considering the effective loop free energy contribution 
and Eq.(3), satisfies 



Rioop ^ R 



loop- 



(5) 



The repression of transcription induced by the loop for- 
mation, in situations where multiple loop formation can 
appear, is affected by the corresponding conditions of 



protein concentrations and loop properties. Repression 
levels in the single-loop scenario have been reported in 

Bi. 

Conclusions- We have shown the presence of strong 
correlations between the different loops that can be 
formed in a given DNA segment. Geometrical changes 
in the chain, caused by the formation of a loop, can al- 
ter the conditions under which another loop may come 
up, thereby implying modifications of the loop formation 
probability and consequently of their statistical proper- 
ties. These correlations can give rise to cooperative ef- 
fects for which loops may appear under otherwise forbid- 
den conditions and to inhibitory effects hampering the 
loop formation under apparently favorable conditions. 
The loop interrelation effect can be quantified through 
an effective free energy which can be computed from a 
Hamiltonian that incorporates all the energies coming 
into play in the process. These collective effects can be 
adapted to a wide combination of physical conditions in- 
side the cell, where small changes of the protein con- 
centrations can dramatically alter the cellular processes 
controlling the repression level. The implications that 
loop collective effects may have in gene regulation pro- 
cesses can then be studied from measurable quantities 
establishing a clear connection between the repression 
level and the possible loop configurations in a fragment 
of DNA. 



[1 
[2: 

[3] 
[4] 

[6: 

[7 

[9: 
[10: 

[11 

[12 
[13 
[14 



M. Ptashne, A Genetic Switch: Phage A revisited, Cold 
Spring Harbor Laboratory (2004). 

J. MiiUer, S. Oehler and B. MiiUer-HiU, J. Mol. Biol. 257, 
21-29 (1996). 

N.A. Becker, J.D. Kahn and L. James Maher III, J. Mol. 
Biol. 349, 716-730 (2005). 

T.E. Cloutier and J. Widom, Molecular Cell 14, 355-362 
(2004). 

J.M.G. Vilar and L. Saiz, Current Opinion in Genetics 

and Development 15, 136-144 (2005). 

R. Metzler, T. Ambjornsson, A. Hanke, Y. Zhang and S. 

Levene, J. Comput. Theoret. Nanoscience 4, 1 (2007). 

P. K. Purohit and P. C. Nelson, Phys. Rev. E 74, 061907 

(2006). 

E. Villa, A. Balaeff and K. Schuhen, PNAS 102, 6783- 
6788 (2005). 

J.M.G. Vilar and L. Saiz, Phys. Rev. Lett. 96, 238103 
(2006). 

O.K. Wong, M. Guthold, D.A. Erie and J. Gelles, PLos 
Biol 6(9):e232 (2008). 

S. Sankararaman and J.F. Marko, Phys. Rev. Lett. 95, 
078104 (2005). 

B. MiiUer-HiU, The lac Operon: A Short History of a 
Genetic Paradigm, Walter de Gruyter, Berlin (1996). 
J. M. G. Vilar and S. Leibler, J. Mol. Biol. 331, 981-989 
(2003). 

L. Saiz, J.M. Rubi and J.M.G. Vilar, PNAS 102, 17642- 
17645 (2005). 



